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Abstract. We propose an extension of the join calculus with pattern matching on alge- 
braic data types. Our initial motivation is twofold: to provide an intuitive semantics of the 
interaction between concurrency and pattern matching; to define a practical compilation 
scheme from extended join definitions into ordinary ones plus ML pattern matching. To 
assess the correctness of our compilation scheme, we develop a theory of the applied join 
calculus, a calculus with value passing and value matching. We implement this calculus 
as an extension of the current JoCaml system. 



The join calculus jl51 I16j is a process calculus in the tradition of the vr-calculus of 
Milner et al. One distinctive feature of join calculus is the simultaneous definition of 
all receptors on several channels through join definitions. A join definition is structured 
as a list of reaction rules, with each reaction rule being a pair of one join pattern and one 
guarded process. A join pattern is in turn a list of channel names (with formal arguments), 
specifying the synchronization among those channels: namely, a join pattern is matched 
only if there are messages present on all its channels. Finally, the reaction rules of one 
join definition define competing behaviors with a non-deterministic choice of which guarded 
process to trigger when several join patterns are satisfied. 

In this paper, we extend the matching mechanism of join patterns, such that message 
contents are also taken into account. As an example, let us consider the following list-based 
implementation of a concurrent stack0 

def pop{r) & State{x: :xs) r{x) & State{xs) 
or push(v) & State{ls) t> State (v. -.Is) 
in 5tate([]) k ... 

1998 ACM Subject Classification: D.1.3, D.3.3, F.3.2. 

Key words and phrases: join-calculus, pattern-matching, process calculus, concurrency. 
* Extended version of 1281 . 
^We use the OCaml syntax for lists, with Nil being [] and Cons being the infix : : . 
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The second join pattern push{v) & State{ls) is an ordinary one: it is matched whenever 
there are messages on both State and push. By contrast, the first join pattern is an extended 
one, where the formal argument of channel State is an algebraic pattern, matched only by 
messages that are cons cells. Thus, when the stack is empty (i.e., when message [] is 
pending on channel State), pop requests are delayed. Note that we follow the convention 
that capitalized channels are private: only push and pop will be visible outside. 

A similar stack can be implemented without using extended join patterns, but instead, 
using an extra private channel and ML pattern matching in guarded processes: 
def pop{r) & Some{ls) \> match Is w^ith 

I [x] —5- r[x) & EmptyO 
I y: :x: :xs — > r{y) & Some{x: ixs) 
or push{v) & EmptyO ^ Some ( [w] ) 
or push{v) & Some{ls) > Some {v: -.Is) 
in Empty & . . . 

This second definition encodes the empty /non-empty status of the stack as a message on 
channels Empty and Some respectively. Pop requests on an empty stack are still delayed, 
since there is no rule for the join pattern pop{r) &: Empty {). The second definition obviously 
requires more programming effort. Moreover, it is not immediately apparent that messages 
on Some are non-empty lists, and that the partial ML pattern matching thus never fails. 

Join definitions with (constant) pattern arguments appear informally in functional 
nets |36| . Here we generalize this idea to full algebraic patterns. A similar attempt has also 
been scheduled by Benton et al. as an interesting future work for Cw jZj. 

The new semantics is a smooth extension, since both join pattern matching and pattern 
matching rest upon classical substitution (or semi- unification) . However, an efficient im- 
plementation is more involved. Our idea is to address this issue by transforming programs 
whose definitions contain extended join patterns into equivalent programs whose definitions 
use ordinary join patterns and whose guarded processes use ML pattern matching. Doing 
so, we leave most of the burden of pattern matching compilation to an ordinary ML pattern 
matching compiler. However, such a transformation is far from obvious. More specifically, 
there is a gap between (extended) join pattern matching, which is non-deterministic, and ML 
pattern matching, which is deterministic (following the "first match policy"). For example, 
in our definition of a concurrent stack with extended join patterns, State{ls) is still matched 
by any message on State, regardless of the presence of the more precise State{x : :xs) in the 
competing reaction rule that precedes it. Our solution to this problem relies on partitioning 
matching values into non-intersecting sets. In the case of our concurrent stack, those sets 
simply are the singleton { [] } and the set of non-empty lists. Then, pattern State{ls) is 
matched by values from both sets, while pattern State{x : : xs) is matched only by values of 
the second set. 

The rest of the paper is organized as follows: Section first gives a brief review of al- 
gebraic patterns and ML pattern matching. Section |31 presents the applied join calculus — 
an extension of join with algebraic pattern matching. We introduce the semantics and the 
appropriate equivalence relations. Section informally explains the key ideas to transform 
the extension to the ordinary join calculus, and especially how we deal with the nonde- 
terminism problem. Section [S] formalizes the transformation as a compilation scheme and 
presents the algorithm which essentially works by building a meet semi-lattice of patterns. 
We go through a complete example in Section |H1 and finally, we deal with the correctness of 
the compilation scheme in Section [7| Implementation has been carried out as an extension 
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of the JoCaml system. We discuss the issues that have arisen during the implementation 
work in Section |H1 

An earlier version of this paper (lacking the detailed proofs and the discussion of the 
implementation) appeared as j28j . 

2. Algebraic data types and ML pattern matching 

This section serves as a brief introduction to algebraic data types and ML pattern 
matching. Interested readers are referred to I26j for further details. 

2.1. Algebraic data types. In functional languages, new types can be introduced by using 
data type definitions and such types are algebraic data types. For example, using OCaml 
syntax, binary trees can be defined as follows: 

type tree = Empty | Leaf of int | Node of tree * tree 
The complete signature of type tree has three constructors: Empty, Leaf , and Node, which 
are used to build the values of this type. Every constructor has an arity, i.e. the number of 
arguments it requires and meanwhile specifies the corresponding types of each argument. In 
this definition, Empty is of arity zero. Leaf is of arity one (and accepts integer arguments), 
and Node is of arity two (both its arguments being themselves of type tree). A constructor 
of zero arity is sometimes called a constant constructor. 

Most native ML data types can be seen as particular instances of algebraic data types. 
For example, lists are defined by two constructors: constant Nil (written [] ) for empty lists 
and Cons (written as the infix : : ) for nonempty ones; pairs are defined by one constructor 
with arity two, (written as the infix ","); and integers are defined by infinitely many (or 
2^^) constant constructors. 

Formally, the algebraic values (for short values) of type t are well-typed terms built 
from the constructors of t. "Well-typed" here means correct with respect to constructor 
arity and argument types. Assuming a countable set of identifiers for constructors, ranged 
over by k, we give the formal definition of values as follows: 

V ::= Algebraic values 

^(^^1) ■'^25 ■ ■ ■ , Vn) 1^ of arity n > 

Type correctness is left implicit: we shall consider well typed terms only. 

Algebraic patterns (for short patterns) of type t are also well-typed terms built from the 
constructors of t, but with variables^ The formal definition of patterns is given as follows. 

TT ::= Algebraic patterns 

X variable 

fi;(7ri, 7r2, . . . , 7r„) k of arity n > 

We further require all variables in a pattern to be pairwise distinct, that is, we only consider 
linear patterns. 

Again, we assume a typed context. More precisely, we rely on the ML type system to 
guarantee that values and patterns are well-typed. Moreover, we rely on a ML type inferer 
to enrich syntax with explicit types (which we leave implicit), and consider that the type of 
any syntactic structure is available whenever needed. Doing so, we focus on our main issue 
and avoid complications that would be of little explanatory value. 



'We freely replace variables whose names are of no importance by wildcards "_" . 
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Patterns are used to discriminate values according to their structures. More specifically, 
a pattern denotes a set of values that have a common prefix specified by the pattern. We 
say a value v (of type t) is an instance of pattern vr (of type t), or that v matches vr, when 
vr describes the prefix of v, in other words, when there exists a substitution a, such that 
vro" = V. For linear patterns, the instance relation can be defined inductively as follows: 

Definition 2.1 (Instance). Let vr be a pattern and f be a value, such that vr and v have 
the same type, the instance relation vr ^ u is defined as: 

K(vri, . . . , vr„) ^ k{vi, . . . , Vn) iff VTj ^ Vi for all 1 < i < n 

We write Ins(vr) for the set of the instances of pattern vr. The instance relation induces the 
following relations among patterns. These relations apply to patterns vri and vr2 that have 
the same type. 

Definition 2.2 (Pattern relations). 

• Patterns vri and vr2 are compatible when they share at least one instance. Otherwise vri 
and vr2 are incompatible written 7^14^^712- Two compatible patterns admit a least upper 
bound written vri | vr2, whose instance set is Ins(vri) n Ins(vr2). 

• Pattern vri is less precise than pattern vr2 written vri ^ when Ins(vr2) C lns(7ri). 

• Patterns vri and vr2 are equivalent written vri = when lns(7ri) = Ins(vr2). If so, their 
least upper bound is their representative, written vr^ | vr2. 

Note that we use the same notation < for both relations: "being an instance of" (which is 
between a pattern and a value) and "being less precise" (which is between two patterns). 
Indeed, values are in fact a special case of patterns (with no variables), and in that case, 
both relations collapse. 

The least upper bound of two patterns can be computed at the same time when com- 
patibility is checked by the following rules: 

_ I vr = vr 
vr |_ = vr 

K(vri, . . . ,vr„) t K;(tJi, . . . ,a;„) = K(vri j wi, . . . , vr„ t "^n) 

Deciding the relation "being less precise" is more involved. Because of typing, there 
exists nontrivial such relations, for instance (_,_) ^ _. The JoCaml compiler relies on 
an efficient algorithm for this task, called the lA algorithm, with lA standing for "Useful- 
ness" |3Uj . Algorithm lA takes two parameters: a list of patterns 11 and a pattern vr, and 
returns a boolean. Roughly speaking, it checks the usefulness of vr with respect to 11. More 
specifically, algorithm U tests the existence of at least one value v such that vr admits v as 
an instance, and none of the patterns in 11 does. 

From the point of view of algorithm lA, deciding the relation vri ^ amounts to 
compute the negation of Z//([vri], vr2). Namely, vri is less precise then vr2, if and only if all the 
instances of vr2 are instances of vri • 

vri ^ ^{[111], 1:2) = useless 

We now give a simplified definition of algorithm lA. The simplified definition suffices for our 
needs and also conveys the basic idea behind the algorithm. 

Consider Z//([vri], vr2), where vri and vr2 are patterns of a common type t. The following 
two cases are distinguished. 
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Case 7r2 = k{u!i, . . . , Wn) 

• If TTi = k(7i, . . . , 7n), then check if 3i,l < i < n, s.t. Z/^([7i], cjj). 

• If TTi = k'(7i, . . . ,7n) and k ^ n' , then useful (i.e. false for vri ^ 7r2). 

• If vT]^ = _, then useless {i.e. true for tti ^ 7r2). 

Case 1^2 = _ 

• If TTi = _, then useless (i.e. true for vri ^ '7r2). 

• If TTi = k(7i, . . . ,7n), 

— if K is the unique constructor of type t, then check if 3i, 1 < i < n, s.t. U{[yi],_)- 

— otherwise useful (i.e. false for vri ^ 7r2)- 

Once we can decide relation we can easily decide pattern equivalence, since, by defi- 
nition, TTi = TT2 means tti ^ 7r2 and 7r2 ^ vri. 

2.2. ML pattern matching. In ML, operating on algebraic data types is performed by 
the use of the following match construct that we extend to processes {Qi, Q2 etc. below 
are processes of the join calculus). 

match V with vri ^ Qi | vr2 — >• (52 | ■ • • I 7i"n — ^ Qn 
Above, we attempt a matching of value v against a sequence of patterns vri, . . . , vr„ of the 
same type. 

ML pattern matching is deterministic. It follows the "first match policy". That is, 
when value v is an instance of more than one of the patterns vTj, the match construct 
chooses the one with the smallest index i. This can be seen as checking patterns vri, vr2, 
. . . , Tin for admitting v as an instance sequentially, stopping as soon as a match is found. 
As a consequence, pattern vr, is matched only by the values in set Ins(vrj) \ (|J-|^<^.^jlns(vrj)). 
Moreover, patterns in ML pattern matching also act as binding constructs. Once a successful 
match is found, say vr^ ■< v, the variables in vr^ are all bound to the corresponding subterms 
of V in the guarded process Qk- 

Additionally, we say a match construct is exhaustive when Ui<j<„lns(vrj) is the whole 
set of values of the considered type. We accept non-exhaustive match constructs. 

3. The applied join calculus 

We define the applied join calculus by analogy with the applied vr-calculus p. The 
applied join calculus inherits its capabilities of communication and concurrency from pure 
join. Moreover it supports algebraic value passing and algebraic pattern matching in both 
join patterns and processes. 

3.1. Syntax and scopes. The syntax of the applied join calculus is given in Figure ^ 
As it is customary in process calculi definitions, we assume an infinite set of identifiers for 
variables, ranged over by x,y,z. 

With respect to pure join calculus, two new syntactic categories are introduced: expres- 
sions and patterns. At first glance, both expressions e and patterns vr are terms constructed 
from variables and constructors, where n stands for the arity of constructor k. We make 
them different syntactic categories for clarity, and also because we require patterns to be 
linear. We also formalize the ML pattern matching in processes, as the new match con- 
struct. Moreover, in contrast to ordinary name passing join calculus, there are two other. 
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p 




Processes 







null process 




x{e) 


message sending 




PkP 


parallel 




def D in P 


definition 




match e with tti ^ Pi . 


• • T^m Pm pattern matching 


D 


••= 


Join definitions 




T 


empty definition 




Jt>P 


reaction 




Dor D 


disjunction 


J 




Join patterns 




x{tt) 


message pattern 




JkJ 


synchronization 


vr 




Algebraic patterns 




X 


variable 




K(7ri,7r2, . . . ,7r„) 


constructor pattern 


e 




Expressions 




a; 


variable 




K(ei,e2, . . . ,en) 


constructor expression 



Figure 1: Syntax of the applied join calculus 



more radical, extensions: first, in message sending, message contents become expressions 
as x(e), that is, we have value passing; second, when a channel name is defined in a join 
pattern, in addition to the synchronization requirement, we also specify what pattern the 
message content should satisfy by x{7r). 

There are two kinds of bindings: the definition process def D in P binds all the channel 
names defined in D (written dv[Z)]) with scope P; and the reaction rule J o P or the ML 
pattern matching match e with tti — > Pi | . . . | Tim Pm bind all the local variables 
(written rv[J] or rv[7rj]) with scope P or Pj, i € {1, • • • jiri}. The definition of the sets of 
defined channel names dv[-] is the same as in pure join. By contrast, the definition of sets 
rv[-] has to be extended, so as to take pattern arguments into account. Meanwhile, the 
definition of sets fv[-] should also be extended, to cater for the new match process and 
expressions. We present the formal definitions of dv[-], rv[-], and fv[-] in Figure |2l In these 
rules, l±) is the disjoint union, which expresses linearity constraints on both algebraic and 
join patterns. 

In applied join, values become of two kinds: channel names or algebraic values. We 
assume a type discipline in the style of the type system of the join-calculus JH]) extended 
with algebraic data types and the rule for ML pattern matching. Without making the 
type discipline more explicit, we consider only well- typed terms (whose type we know), 
and assume that substitutions preserve types. It should be observed that tuples are now 
represented as a kind of constructed expressions and the arity checking of polyadic join 
calculus is now replaced by a well- typing assumption in applied join, which is thus monadic. 
One important consequence of typing is that any (free) variable in a term possesses a type 
and that we know this type. Hence, we can discriminate between those variables that are 
of a type of constructed values and those that are of channel type. Following the semantics 
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For algebraic patterns : 



rv[x] 


QCI 


{x} 


rv[K(7ri, 7r2, . . . , 7r„)] 


dcf 


r\/\'Tr^] l4-l r\/\'Tr^] l4-l l4-l r\ 

1 v[7rij ty 1 v[7r2j ttJ . . . ty r 


For expressions : 






fv[x] 


def 


{x} 


fv[K(ei,e2, . . . ,en)] 


dcf 


fv[ei] Ufv[e2] U . . . Ufv 


For join patterns : 






rv[a;(7r)] 


def 






rv[7rj 


rv[Ji & J2] 


dcf 


rv[ Jil tt) rv[ J2I 

L J- J L ^ J 


Ov[X[TT)\ 


dcf 


|x| 


rlitl To T 1 


dcf 


QV[JiJ l±l uV[J2j 


For join definitions : 






dv[T] 


def 


rA 




dv[J ;> P] 


dcf 


dv[J] 


av[iJi or D2\ 


dcf 


dv^L'ij u dv[L'2j 


fv[T] 


dcf 





fv[J;>P] 


dcf 


dv[J]u(fv[P]\rv[J]) 


W[Di or D2] 


dcf 


fv[i?i] ufv[Z)2] 


For processes : 






fv[0] 


dcf 





fv[2;(e)] 


dcf 


{x} U fv[e] 


fv[Pi&P2] 


dcf 


fv[Pi]Ufv[P2] 


fv[def D in P] 


dcf 


{W[D]uW[P])\dv[D] 


fv[match e with|*^^ vTj — > Pj] 


dcf 


fv[e]u(Ue/fv[^'d\rv 


For solutions : 






6y[V] 


dcf 




fv[P] 


dcf 




W[V] 


dcf 





Figure 2: Bindings and scopes in the applied join calculus 
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Str-Null h h 

Str-Par hPi&Pa ^ HPi,P2 

Str-Top T K ^ h 

Str-Def h def D in P ^ h P 

React J > P or D \- Ja — > J > P or D \- Pa 

Match h match tTj?] with tti — > Pi | . . . | iTm Pm 

^Pii] 



Side conditions: 

Str-Def 

React 

Match 



dv[P)] is fresh 

a substitutes closed expressions for TV [J] 
■q substitutes closed expressions for rv[7rj] 
and Vj < i, VTj nir] 



Figure 3: RCHAM of the applied join calculus 



of name passing calculi such as join, we treat the latter kind of variables as channel names, 
that is, values. While, in any reasonable semantics, the former kind of variables cannot be 
treated so. We call a term variable- closed {closed for short) when its free variables are all 
of channel type, and otherwise open. 



3.2. Chemical semantics. We establish the semantics following the reflexive chemical 
abstract machine (RCHAM) style — the reflexive variant of CHAM |H], whose states are 
chemical solutions. A chemical solution is a pair D \- V, where P is a multiset of (active) join 
definitions, and "P is a multiset of (running) processes. Extending the notion of closeness 
to solutions in the member-wise manner, we say a solution is closed when all its active 
join definitions and running processes are closed, namely, free variables are all of channel 
type. We define semantics only on closed solutions. The chemical rewriting rules are given 
in Figure El consisting of two kinds as in join: structural rules ^ or ^ represent the 
syntactical rearrangement of the terms, and reduction rules — > represent the computation 
steps. We follow the convention to omit the part of the solution that remains unchanged 
during rewrite. This can also be expressed by the following context rule: 

Context 

VohVo ^ Vih Pi 

V,Vo^ro,V ^ P,PihPi,P 

where — > stands for either ^ or — >, and T> and P are the independent context of the 

considered subsolution. Rule Str-Def is a bit of exception because its side condition ac- 
tually requires the following relationship hold between the rewriting part and its context: 

dv[P>] n (fv[P] ufv[P]) = 0. 

Finally, it is perhaps to be noticed that, amongst the various, slightly different, seman- 
tics of join-machines, we extend the one of ^Hl) which is adapted to static typing. This 
means that we need to state explicitly that or is an associative-commutative operator. As 
a consequence, the notation Jt>PorDm rule React stands for a definition that possesses 
a reaction rule whose pattern is J. 
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Matching of message contents against formal pattern arguments is integrated in the 
substitution a in rule React. As a consequence this rule does not formally change with 
respect to ordinary join calculus. However its semantical power has much increased. The 
Match rule is new and expresses ML pattern matching. Its side condition enforces the first 
match policy. 

According to the convention of processes as solutions, namely P as h P, the semantics 
is also defined on closed processes in the following sense. 

Definition 3.1. Let ^* denote the transitive closure of ^ U 



3.3. Equivalence relation. In this section, we equip the applied join calculus with equiv- 
alence relations to allow reasoning over processes. The classical notion of barbed congruence 
is a sensible behavioral equivalence based on a reduction semantics and barb predicates. It 
was initially proposed by Milner and Sangiorgi for CCS j34j. and adapted to many other 
process calculi [22113], including the join calculus. We take weak barbed congruence |33 as 
our basic notion of behavioral equivalence for closed processes. 

3.3.1. Observational equivalence for closed processes. 

Definition 3.3 (Barb predicates). Let P be a closed process, and x be a free channel name 
in P, 

(1) P has a strong barb on channel x: P ix, iff P = def D in Q ^ x{e), for some D, Q and 
e, where x clv[P>]. 

(2) P has a weak barb on channel x: Pij-x, iff P — >* P', such that P' Ix- 
where — >* denotes the reflexive and transitive closure of — k 

Following the definition, it is easy to check that two structurally congruent processes 
maintain the same barbs, i.e. the lemma below. 

Lemma 3.4. For two closed processes P dTid wheTicvcT P = C^^ wc have P \,x ^ff Q l-x? 
and Pij-x iff Q^x- 

Proof. The part for strong barb holds following the transitivity of =, and the part for weak 
barb holds following Lemma 13.21 □ 



(1) P = Q iff h P ^* h Q 



(2) P — ^ Q iff h P ^* — h Q 



Subsequently, we have the following structural rule: 
Lemma 3.2. If P — >Q, P = P' , and Q = Q' , then P' 
Proof. Trivially follow the definitions of = and — and 




□ 
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Definition 3.5 (Weak barbed bisimulation) . A binary relation TZ on closed processes is a 
weak barbed bisimulation, iff whenever P TZ Q, we have: 

(1) If P — >* P', then 3Q', such that Q — >* Q' and P' TZ Q' , and vice versa. (7^ is a 
reduction bisimulation.) 

(2) Pij-x implies Qij-x for any channel x, and vice versa. {TZ preserves barbs.) 

To make the definition easier to work with, we prove the following lemma where P — >* 
P' is replaced by P — > P' in the first clause, and P \yx is replaced by P [x in the second 
clause. 

Lemma 3.6. Let TZ be a binary relation on closed processes that satisfies the following two 
conditions for any processes P and Q such that P TZ Q: 

(1) If P — > P' , then 3Q' , such that Q — Q' and P' TZ Q' , and vice versa. 

(2) P Ix implies QJJ-x for any channel x, and vice versa. 
Then TZ is a weak barbed bisimulation. 

Proof. We check against the two clauses of Definition 13.51 for one direction. The proof of 
the other direction is symmetric. 

(1) 7^ is a reduction bisimulation, that is 

P — >* p' =^ 3Q\ s.t. Q — >* Q' and P' TZ Q' 
We reason on the length of the derivation P — >* P', written n. 

Base case, n = 0, 1, trivial. 

Induction case. As illustrated in the following diagram chase, 

P TZ Q 

Pi n Qi 



p' n Q' 

we have P — ^""^ Pi — > P' . By induction hypothesis, we have 3Qi, s.t. Q 
Qi and PiTZ Qi. By applying hypothesis (1) to to Pi and Qi, we also have 3Q', s.t 
Q' and P' TZ Q'. And we conclude. 
(2) TZ preserves barbs, that is Pij-x =^ QJJ-x- We thus assume Pij-x- That is, 

3P',P^* P' and P'ix 

By (1) above, 

3Q', Q — >* Q' and P' TZ Q' 
Then by applying hypothesis (2) to P' and Q' , we get Q' ^x- Hence we have Q^x 
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In later discussion, we sometimes directly check against the two conditions of Lemma l3.6l 
instead of the ones of Definition 13.51 for weak barbed bisimulation. 

We define a context as a term built by the grammar of process with a single process 
placeholder [•]. An evaluation context E[-] is a context in which the placeholder is not 
guarded. Namely: 



In addition to evaluation contexts, there are two kinds of guarded contexts, referred to 
as definition contexts {i.e. def J \> [■] or D in P) and pattern matching contexts {i.e. 
match e with . . . | vr^. ^ [•] | . . .). We say that a context is closed if all the free variables 
in it are of channel types. 

Definition 3.7 (Weak barbed congruence). A binary relation on closed processes is a weak 
barbed congruence, iff it is a weak barbed bisimulation and closed by application of any 
closed evaluation context. We denote the largest weak barbed congruence as ~. 

The weak barbed congruence ~ is defined on the closed subset of the applied join 
calculus. Although the definition itself only requires the closure of evaluation contexts, 
it can be proved that the full congruence does not provide more discriminative power. 
Similarly to what Fournet has established for the pure join calculus in his thesis we 
first have the property that ~ is closed by substitution because, roughly, name substitutions 
may be mimicked by evaluation contexts with "forwarders". 

Lemma 3.8. Given two closed processes P and Q, if P k, Q, then for any substitution a, 
Pa ~ Qa. (Note that "closed" stands for "variable-closed" . ) 

Proof. The main idea is to build an evaluation context E[-] whose task is to forward messages 
from names to names according to the substitution a, and to prove the equivalences Pa ~ 
E[P] and Qa ss E[Q]. Because f« is closed by evaluation contexts, we also have P ^ Q =^ 
E[P] ~ Then we conclude by the transitivity of ~. Refer to the proof of Fournet 

in |15| Lemma 4.14 of Chapter 4] for details. □ 

Then based on this property, the full congruence is also guaranteed considering the fact 
that the essence of a guarded context is substitution. 

Theorem 3.9. Weak barbed congruence ~ is closed by application of any closed context. 

Proof. Corollary of Theorem 13.191 that we prove later on. □ 

Up to now, we have defined the weak barbed congruence to express the equivalence of 
two closed processes. However, our purpose is to study the correctness of a static transfor- 
mation. Since static transformations apply perfectly well to processes with free variables 
of non-channel type, restricting ourselves to the world of closed processes is not an option. 
In the next section, we will derive an equivalence relation for open processes. But before 
getting into the definition, let us first establish some up-to techniques on the closed sub-set 
of the calculus. Such up-to techniques will be used during the courses of proving upcoming 
lemmas and theorems. 

Definition 3.10 (Weak barbed congruence up to =). A binary relation TZ on closed pro- 
cesses is a weak barbed congruence up to =, iS P TZ Q implies: 

(1) for any closed evaluation context E[-], E[P] =TZ= E[Q] {TZ is closed under evaluation 
contexts up to =); 




def 



[•] I E[-]^P I PkE[-] I defL»in^;[-] 
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(2) whenever P — >* P', 3Q' , such that Q — >* Q' and P' =Tl= Q' , and vice versa (7^ is a 
reduction bisimulation up to =); 

(3) Pij-x imphes Qij-x for any channel x, and vice versa. (JZ preserves barbs.) 

As we did for plain weak barbed bisimulation (Definition 13. 5j) in Lemma 13.61 we intro- 
duce the following weakened conditions for checking weak barbed congruence up to =. 

Lemma 3.11. Let TZ be a binary relation on closed processes and TZ that satisfies the 
following three conditions for any processes P and Q such that P TZ Q: 

(1) for any closed evaluation context E[-], E[P] =1Z= E[Q]; 

(2) If P — > P' , then 3Q' , such that Q — >* Q' and P' =1Z= Q' , and vice versa. 

(3) P ix implies Q\\-x for any channel x, and vice versa. 
Then TZ is a weak barbed congruence up to =. 

Proof. We check against the three clauses of Definition 13.101 

(1) The first clause is the same as clause (1) of Definition 13. 101 

(2) We show: 

P — y* p' =^ 3Q', s.t. Q — >* Q' and P' =TZ= Q' 
We reason on the length of the derivation P — >* P', written n. 

Base case, n = 0, 1, trivial. 

Induction case. As illustrated in the following diagram chase, 

p n Q 

vn-l V* 

Pi = P2 n Q2 = Ql 



p' = n = Q' 

we have P — >^~^ Pi — > P' . By induction hypothesis, we have 3(5i, (52) -fbj s.t. 
Q — >* Ql and Pi = P2TZQ2 = Qi. Following LemmaEH we have P2 — > P' , too. By 
applying hypothesis (2) to P2 and Q2, we also have 3Q', s.t. Q2 — >* Q' and P' =TZ= 
Q'. Then by Lemma 13.21 again, we have Qi — >* Q' , too. To conclude, we have 
3Q', s.t. Q — >* Q' and P' =TZ= Q'. 

The proof of the other direction is symmetric. 
(3) We show: 

P JJ-x ' ' Q 

We thus assume Pi^x- 

3Pi, s.t. P^* Pi and Pi L 

By (2) above, we get: 

3Qi,Q2, P2, s.t. Q Ql and Pi = P2 ^ Q2 = Qi 

By Lemma EiH we have P2 ix- Applying hypothesis (3) to P2 and Q2, we get Q2ij-x- 
Then by Lemma l3 . 41 again . we have Qi-IJ-^- To conclude, we have Q — >* Qi and Qiij-x, 
i.e. Qij-x- The proof of the other direction is symmetric. 
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Lemma 3.12. IfTZisa weak barbed congruence up to =, then TZQ^. 

Proof. We first show =TZ= C w, i.e. =TZ= is a weak barbed congruence. 

(1) =TZ= is closed under evaluation contexts. Given P =TZ= Q, there exist Pi and Qi such 
that P = Pi TZ Qi = Q. Let us name two properties: 

(a) = is closed under evaluation contexts; 

(b) clause (1) of Definition I3.1U1 

Then, for any closed evaluation context E[-], we have: 

E[P] i E[Pi]^n^E[Qi] i E[Q] 
By transitivity of =, we conclude: 

E[P] =7^= E[Q] 

(2) =TZ= is a reduction bisimulation. We use clause (2) of Definition I3.1UI and then 
Lemma 13.21 to reason by diagram chase as follows: 

p = Pi n Qi = Q p = Pi n Qi = Q 



n = 




= n = 





(3) =TZ= preserves barbs. Given P =TZ= Q, we have P 
statement, 

De/IUniO) 



Pi TZ Qi = Q, and the following 



„ I Lemma 
-r ix =^ Pi ix 



„ 1 1 Lemma 
VI Vvx 



and vice versa. 



Then because TZ C =TZ= C ss, we conclude that TZ C 



□ 



A standard proof technique is then to consider weak barbed congruence up to ~. How- 
ever, as demonstrated in such a technique does not work in general in weak settings. 
Thus, we instead define another relation, where up to ~ is performed on one side only. This 
new relation is sound, as shown by the forthcoming Lemma 13.151 

Definition 3.13 (Weak barbed congruence up to /d 0). A binary relation TZ on closed 
processes is a weak barbed congruence up to /c/, iff P 7^ Q implies: 

(1) for any closed evaluation context E[-], E[P] =TZ= E[Q] {TZ is closed under evaluation 
contexts up to =); 

(2) whenever P — >* P', 3Q' , such that Q — >* Q' and P' 7^« Q'; 

(3) whenever Q — >* Q', 3P' , such that P — >* P' and P' ^TZ Q'; 

(The two clause above say that P is a reduction bisimulation up to Id.) 

(4) Pij-x implies Qij-x for any channel x, and vice versa. {TZ preserves barbs.) 

Again, we first derive the following alternative conditions for checking weak barbed 
congruence up to Id. 



"^Id stands for the identity relation on closed processes. Note that this relation is derived from "bisimula- 
tion up to almost- weak bisimulation" in |41|. because Id is included in almost- weak bisimulation, with some 
adjustments to the barbed setting. 
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Lemma 3.14. TZ is a binary relation on closed processes and TZ satisfies the following 
conditions for any processes P and Q such that P TZ Q: 

(1) for any closed evaluation context E[], E\P\ =TZ= E[Q]; 

(2) whenever P — > P' , 3Q' , such that Q — >* Q' and P' IZ^ Q' ; 

(3) whenever Q — > Q' , 3P' , such that P — >* P' and P' ^TZ Q' ; 

(4) P Ix implies Qij-x for any channel x, and vice versa. 
Then TZ is a weak barbed congruence up to Id. 

Proof. We check against the clauses of Definition 13.131 

(1) The first clause is the same. 

(2) We show: 

P — ,* p' =^ 3Q', s.t. Q — >* Q' and P' TZ^ Q' 
We reason on the length of the derivation P — >* P' , written n. 

Base case, n = 0, 1, trivial. 

Induction case. As illustrated in the following diagram chase, 

P n Q 

vn-l V* 

Pi n Q2 ^ Qi 



P' TZ a Qs ^ Q' 

we have P — Pi — P'. By induction hypothesis, we get 3Qi, such that 
Q — >* Qi and Pi TZfv Qi. That is, 3(52, such that Pi TZ Q2 Qi- By applying 
hypothesis (2) to Pi and Q2, we have BQs such that Q2 — >* Q3 and P' TZ^ Q3. 
Because Q2 ^ Qi, we also have 3Q' such that Qi — >* Q' and Q3 ^ Q' — remember 
that ~ is the largest weak barbed congruence and thus a reduction bisimulation. 
We conclude by transitivity of ~. 

(3) Symmetric of (2) above. 

(4) We show: 

P -IJ-x Q -U-a; 

We thus assume Pij-x- That is, we have: 

3Pi s.t P — >* Pi and Pi ix 

By (2) above, we get: 

3Qi, Q2, s.t. Q — >* Qi and Pi 7^ Q2 ~ Qi 

Applying hypothesis (4) to Pi and Q2, we get Q2^x- Applying clause (2) of Defi- 
nition to Q2 and Qi, we then get Qiij-x- To conclude, we have Q — >* Qi and 
Qiij-x, ^-e. Qi\-x- The proof of the other direction is symmetric. 

□ 
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Lemma 3.15. IfTZisa weak barbed congruence up to Id, then 7^Cf». 
Proof. We first show k.IZk, C Ri, i.e. ~7^~ is a weak barbed congruence. 



(1) 



(2) 



(a) 

E[P] W E[P^] 



K.'R.K. is closed under evaluation contexts. Given P tuTZth Q, there exist Pi and Qi such 
that P PiTZ Qi Q. Let us name two properties: 

(a) is closed under evaluation contexts; 

(b) clause (1) of Definition I.S.13| 

Then, for any closed evaluation context E[-], we have: 

ib) 

"n^ElQi] fe^ E[Q] 
Because = C fa, we have =1Z= C f»7^f». Hence we have: 

E[P] ^ E[Pi] E[Qi] « E[Q] 

And we conclude, by transitivity of ~. 

~7^~ is a reduction bisimulation. We use clause (1) of Definition 13.51 clause (2) of 
Definition 13.131 clause (1) of Definition 13. 5| and the transitivity of ~, in the proof 
sketched by the following diagram: 

p ^ Pi n Qi ^ Q P ^ Pi v. Qi ^ Q 



p' 
p' 



p{ 

n 



i 

p' 

p' 



Pi Qi 



(3) thIZk. preserves barbs. Given P ^TZ^ Q, we have P ^ Pi TZ Qi ^ Q, and the following 
statement, 

Pif^W'^^ p^^^,-/^(3) Q^^f^fm^) , and vice versa. 
Then because TZ C wT^w C ps, we conclude that TZ C Ki. □ 



3.3.2. Observational equivalence for open processes. The approach we follow here is to lift 
the equivalence relation of closed processes to open processes by closing up by all substitu- 
tions, and we call the resulting relation open equivalence. 

Although both are "open", our open equivalence is unrelated to the open bisimilarity 
of Sangiorgi in [301 ■ We use "open" to name our equivalence relation because it relates 
open terms. By contrast, "open" in open bisimilarity emphasizes a characteristic of the 
bisimulation definition, namely free names are open to equality throughout the bisimulation 
game. From the perspective of where and when to apply name substitutions, for open 
equivalence, we instantiate free names (and variables) only at the beginning before we test 
the resulting (closed) processes for weak barbed congruence. On the contrary, in the case 
of open bisimilarity, such instantiation happens at every co-inductive step. 

Another way to define equivalence relations on open terms could be to adapt the seman- 
tics to symbolic transition system and to define a symbolic barbed congruence like in 
Although the symbolic method is claimed to be easier for analysis and verification, we found 
open equivalence to be lighter and more intuitive. As a matter of fact, it is not uncommon 
to define functions extensionally, i.e. by considering application to all possible arguments. 
Moreover, as can be seen in Section [71 our proofs remain tractable. 
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Definition 3.16 (Open equivalence c= ). Two processes P and Q are open equivalent, 
written P c= Q, iff for any substitution a such that Pa and Qa are closed, we have 
Per f» Qa. 

As a corollary, c= is closed by any substitution. 
Lemma 3.17. P - Q ^ Vct.Pct ^ Qa 

Proof. We assume P ^ Q and let cr be a substitution. We need to prove that Pa ^ Qa. 
That is, we need to prove that, for all closing substitution p, we have: 

iPa)p^{Qa)p 

Thus, we need to prove that, for all closing substitution p, we have: 

P{poa) ^Q{poa) (3.1) 

where o stands for substitution composition, i.e. P{poa) =^ {Pa) p. It remains to observe 
that poa closes both processes P and Q, and to apply the definition of P o Q, before 
concluding that statement (|3.1jl above holds. □ 

We aim at proving that ^ is closed by any contexts (Theorem 18. 191 below) . To prove 
the theorem, we need the following rather unusual lemma, to state the fact that although 
we have introduced "deterministic" reduction into the process calculus by extending it with 
the match construct, this kind of determinism does not impact process equivalence. 

Lemma 3.18. We say a closed process P deterministically reduces to P' , iff for all P" 
such that P — P" , we have P' = P" . For any such pair of closed processes P and P' , we 
have P^P' . 

Proof. Let 7^ be the relation {(def P> in P&Q, def D in P' & Q), (S, 5)} for all closed 
definitions D, closed processes Q and S, and all (P, P') pairs such that P deterministically 
reduces to P' . We prove that 7^ is a weak barbed congruence up to =. 

• By definition, TZ is closed by evaluation contexts up to = (i.e. Lemma l3.11l fl)). 

• We show that TZ preserves barbs (i.e. Lemma I3.11l f3)). We omit the (trivial) discussion 
of pairs of identical processes {S,S) in TZ. We show that (def D in P ^Q)lx =^ (def 
D in P' Sz Q) JJ-a;. We distinguish the cases that make (def D in P ^Q) hold. 

— P ix- Obviously reduction cannot erase a barb (x dv[P)]), i.e. we have P' ix- Hence, 
we have (def D in P' & Q) j^. 

— Qix- Trivial. 

As to the opposite direction i.e. (def D in P' ^Q)ix ==^ (def D in P ^Q) \l-x, '\t holds 
trivially because def D in P ^Q — > def D in P' SzQ- 

• We show 7^ to be a reduction bisimulation up to = (i.e. Lemma I3.11l f2l). We omit the 
trivial case of pairs of identical processes in TZ, that is, we only consider process pairs of 
form: (def P> in P&Q, def P> in P' & Q). 

— If the reduction of the left part is caused by a reduction on Q alone or by the interaction 
between D and Q, yielding def D in P ^ Q', then the right part can perform the same 
reduction step, yielding def D in P' & Q'. The resulting two processes are still in 
relation TZ with Q being Q' . Vice versa. 

— If the reduction of the left part is caused by a reduction on P alone, then, because P 
deterministically reduces to P', the resulting process is def D in P'&Q (up to =). 
Thus, the right part simulates with no reduction and def D in P' SzQ satisfies relation 
TZ with itself. 
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— If the reduction of the left part is caused by the interaction between D and P, then we 
must have P = Pq & where J t> G is a reaction rule in D and the resulting process is 
def D in Pq kGa. Because J a does not reduce by itself and P deterministically 
reduces to P', we have P' = Pq&i Jcr and Pq deterministically reduces to Pq. Therefore, 
the right part simulates by an identical reduction and gives def D in Pq &; Q & Ga. 
The resulting two processes are still in relation TZ with Q being Q ^Ga, P being Pq, 
and P' being Pq. 

— If the reduction of the right part is caused by a reduction on P' itself or by the 
interaction between D and P' , then the left part can always simulate the reduction by 
first reducing def P* in P & Q to def D in P' & Q. 

Following the analysis above, 7^ is a weak barbed congruence up to =. Besides we have 

P = (def TinP&O) 7^ (def TinP'&O) = P' 

Moreover, by the proof of Lemma l3.121 relation =TZ= is a weak barbed congruence. Hence 
we conclude P k, P' . □ 

Theorem 3.19. The open equivalence ^ is a full congruence. 

Proof. We demonstrate is closed by 1. evaluation contexts, 2. definition contexts, and 
3. pattern matching contexts. In the proof, we locally use A, B, R, S, T, V, W, X, Y, Z 
to denote various processes. 

1. Closed by evaluation contexts: E[-]. We show: 

P ^ Q ^ E[P] ^ E[Q] 

For any substitution a such that {E[P])a and {E[Q])a are closed, we need to prove {E\P])(t ~ 
{E[Q])a. We write {E[P])a as Ea[P(Ji] and {E[Q])a as Ea[Qai], where Ea[-], Pai, Qai 
are closed and ai is a minus the (possible) bindings for the channel names bound by E in 
[•]. By hypothesis P o Q, we have Pai Qai. Then, Ea[-] being a closed evaluation 
context, we conclude, by definition of ~. 

2. Closed by definition contexts: def J > [•] or D in R. We show: 

P - Q ^ (def J>P or DinR) <^ (def J > Q or D in R) 

For any substitution a such that (def J > P or P) in P)(7 and (def J t> Q or P) in P)(T are 
closed, we need to prove: 

(def J>P or Din R)a w (def J > Q or D in R)a 

namely, 

def J > Pai or Da2 in Ra2 ~ def J > Qai or Da2 in Ra2 (3-1) 

where (T2 is a minus the (possible) bindings for the channel names defined in Jt>P or D (i.e. 
dv[J>P or P]), and ai is o"2 minus the (possible) bindings for the variables of rv[J]. Notice 
that, by contrast with the subcomponents Da2 and Ra2 that are closed, the processes Pai 
and Qai may not be closed, since some of the variables in rv[J] may be of an algebraic 
type. Nevertheless, by hypothesis P o Q and Lemma l3.17| we have Pai ^ Qai. 
Then, we build the following relation TZ on closed processes: 

7^= {(def J > S or D in A,de{ J >T or D in B) \ S <^ T and A ^ B}. 
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We analyze the following three aspects of TZ: closure by closed evaluation contexts; preserv- 
ing barbs; and reduction bisimulation. 

• 7?. is closed by closed evaluation contexts up to = {i.e. Lemma l3.14l fl)l. For any closed 
E[-], with necessary a-conversions left implicit, we have: 

^[def Jt> S or Din A] =def J>S or {D or D') in {A & K) 

E[dei J>Tor DinB]=def J[>Tor {D or D') in {B & K) 

where A^ K B ^ K , because w is preserved by the closed evaluation context [•] &-fC. 

• TZ preserves barbs (i.e. Lemma I3.14l f4)l. We write I?[X,y] for the closed process def 
J > X or D inY. Since 7^ is a symmetric relation, we only need to prove: 

V[S,A]i^ =^ V[T,B] ij.. 

Because V[S, A] implies Ai^ and x (dv[J] U dy[D]), we also have P[T, A] U. More- 
over, because V[T, ■] is a closed evaluation context, and by hypothesis A ^ B, we have 

V[T,A]^V[T,B] (3.2) 

By clause (2) of Definition 13.51 we finally get T>[T, B]ij-x- 

• 7^ is a reduction bisimulation up to Id (i.e. Lemma l3. 141 f2) and (3)). We first prove the 
following statement. For any two I'[5', A] and ^[T, ^4], we have: 

If V[S, A] — > W, then V[T, A] — > V, and WTZV. (3.3) 

There are three subcases, depending on the nature of the reduction to W . 

(1) A — > A' imdW = V[S,A']. Then V[T,A] — > V[T,A'], with obviously V[S,A'] TZ 
V[T,A'], since A' ^ A' . 

(2) A = AohJr] an.dW = V[S, Aq & Srj]. Then V[T, A] — > V[T, Aq & Tr]]. Notice that 
Sr] and Trj are closed. Then, from S c= T, we get Sr] ^ Trj, and thus Aq^St] f» 
^0 & Ti]. That is, we get V[S, Aq & Srj] TZ V[T, Aq & Trj] . 

(3) A = AotJiVi, D has form ...or Ji > Pi or..., and W = V[S, Aq ^ Pirji]. Then 
V[T, A] — > T^[T, Aq & Pir]i]. And we conclude, as we did in case 1 above. 

Moreover, from equivalence 1)3.2(1 and since ~ is a bisimulation, we have: 

If V[T, A] — > V, then 3V' s.t. V[T, B] — >* V, and V ^ V . (3.4) 
Combining both statements (|3.3j) and (|3.4|) . we get: 

If V[S, A] — > W, then s.t. P[r, B\ — V , and WTZ^V. (3.5) 
The proof of the other direction is by symmetry. 

Following the analysis above, 7^ is a weak barbed congruence up to /d, hence by 
Lemma 13.151 7^ C ss. Obviously, the two processes of statement (|3.1|) are related by TZ. 
Therefore, ()3.1|) holds. In other words, o is closed by any definition context. 
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3. Closed by pattern matching contexts: match e with ... | vr^ ^ [•][.. .. We show: 
P - Q ^ 

(match e with . . . | vr^ — > P | . . .) <^ (match e with ... \ irt ^ Q \ • • •) (3.6) 

To establish the right part, we need to show: 

(match e with ... | 7rfc — > P | . . .)a ~ (match e with ... | vr;^- — > Q I • • •)'^ 

for ah a, s.t. (match e with . . . | tt/j — > P | . . .)(t and (match e with . . . | vr^ ^ Q | . . .)a 
are closed. Namely, 

match ecj with ... | vr^ ^ Pfj^ | . . . ~ match ea with ... | vr^ — > Qa^ \ ■ ■ ■ (3.7) 

where is a minus the (possible) bindings for the variables of rv[7rfe]. Notice that ea is 
closed, while Pak and Qak may not be. 

By the semantics of ML pattern matching, match ea with ... | vr^ — > Pa^ \ ... 
deterministically reduces to either P{rjk o ak) or Rirji, depending on the value of ea. Process 
Ri is the ith guarded process (i ^ A;) in this pattern matching, r]k and i]i stand for the 
substitutions that originate from algebraic matching. Notice that P(% o a^) and RiJji now 
are closed processes. We have the similar statement for match ea with ... | tt^ — >■ Qa^ | . . .. 
Therefore, by Lemma 13.181 we have either: 

match ea with ... | vr/^. — > Pa^ | . . . ~ P(% o Cfc) (3-8) 

match ea with ... | vr^ ^ Qcr^ | . . . ~ Q{7]k o Cfc) (3-9) 

or we have: 

match ea with ... | vr^ — > Pak | . . . ~ Ri'r]i (3.10) 

match ea with ... | vr^ ^ Qak | . . . ~ Ri'rji (3-11) 

Obviously we have Riiji » Ri'i]i- Moreover, since P ^ Q, we get P{r]k°f^k) ~ Q{i]k°'^k)- 
Then, by the transitivity of « and, either by (|3.8() - (|3.9() . or by H3.10() - (|3.11() . we conclude 
that the statement (|3.7() holds. 

Additionally, in the case where ea matches none of the patterns in ()3.7() , both processes 
are blocked and are to the null process 0. □ 

There is still a good property worth noticing: for the closed subset of the applied join- 
calculus, the equivalences =c^ and ~ coincide. This is straightforward by the definition 
of =c^ and by Lemma 13.81 Then, Theorem 13.91 follows as a corollary. 



4. Transforming pattern arguments into ML pattern matching 

The extension of the join calculus that we have presented up to now remains quite 
simple, in particular as regards chemical semantics. However, an efficient implementation 
is more involved. Our approach is to first transform the extended join definitions into 
ordinary ones plus ML pattern matching, then reuse the existing implementation of join. 
In this section, we explain informally the key ideas of the transformation. 

The extended join-pattern matching in applied join requires to test message contents 
against pattern arguments, while the ordinary join-pattern matching in join is only capa- 
ble of testing message presence. Our idea is to separate algebraic pattern testing from 
join-pattern synchronization, and to perform the former operation by using ML pattern 
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matching. To avoid inappropriate message consumption, message contents are tested first. 
Let us consider the following join definition where channel x has two pattern arguments: 

def x{ni) & . •) t" Pi 
or x(7r2) & ?/2(. . .) > -P2 
We refine channel x into more precise ones, each of which carries the instances of patterns vri 
or 7r2: 

def x^^{...) k yi{...) Pi 
or a;^2(. . .) & y2(- • •) » -^2 
Then, we add a new reaction rule to dispatch the messages on channel x to either 
or x{z) t> match z with 

I TTi Xt,^{...) 
I 7r2 —> XttzI---) 

1-^0 

Note that the null process is used in the last matching rule to discard messages that match 
neither tti nor 7r2. 

The simple compilation above works perfectly, as long as tti and 112 are incompatible. 
Unfortunately, it falls short when vri and 112 have common instances. Consider the situation 
where there is a message pending on channel ^2) none on y^, and also a message u on x 
where f is a common instance of patterns vri and 712- Then, following the first match policy, 
the deterministic ML pattern matching can only dispatch x{v) to the refined channel a^i- 
As a result, the guarded process P2 is not triggered, whereas it could have been0 To tackle 
this problem, further refinements are called for according to the following cases. 

• If vTi ^ 7r2, (but 112 'fi 71"! ), that is if all instances of 1^2 are instances of vri, then, to get a 
chance of meeting its instances, pattern 7r2 must come first: 

or x{^z) match z with 

I Vr2 Xvral"-) 

I vTi ^ a;^i(...) 
1-^0 

But now, channel does not carry all the possible instances of pattern vri any more, 
instances shared by pattern vr2 are dispatched to Xt^^. As a consequence, the actual 
transformation of the initial reaction rules is as follows: 

def x^^{. . .) & 2/i(. . .) Pi 

or . .) & 2/i(. . .) > Pi 

or . .) & 2/2(- • •) t> ^2 
Observe that nondeterminism is now more explicit: an instance of vr2 sent on channel x 
can be consumed by either the second or the third reaction rule to trigger either Pi or 
P2. We can shorten the new definition a little by using or in join patterns: 

def . .) or x^^{. . .)) & yx{. . .) Pi 

or XttsC- • •) & y2(- • •) ^2 
Here the disjunctive composition ( Ji or J2) in join patterns works as syntactic sugar, in 
the following sense: 

J&(Ji or J2) > P = (J& Ji > P) or (J& J2 > P) 

• If vri = vr2, then matching by their representative is enough: 



'Given our implementation "limited fairness guarantee" , it can be argued that P2 should be triggered. 
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def a;^j|^2(. . .) & yi{. . .) > Pi 
or x^^i^^{...) & y2{---) P2 
or x{z) match z with 

I VTi |7r2 ^ ^nilTZii- ■ •) 

1-^0 

• Finally, if neither vri ^ 7r2 nor 7r2 vri holds, with tti and 7r2 being nevertheless compat- 
ible, then an extra matching by pattern tti | tt2 is needed: 

def {x^-,{. . .) or 2;^j|^2(. . .)) & yi(- • •) t> A 
or {xn^i- ■ •) or a;^it,r2(- ■ ■)) & 2/2(- ■ ■) > P2 
or ^(z) l> match z with 

I VTi t 7r2 ^ l^ral- • •) 

I 71"! ^ X^^{. ..) \ TT2 ^ X^2(. . .) 

1-^0 

Note that the relative order of vri and 7r2 is irrelevant here. 
In the transformation rules above, we paid little attention to variables in patterns, 
by writing X7r(- • •)• We now show variable management by means of the concurrent stack 
example. Here, the relevant patterns are vri = Is and tt2 = x: :xs and we are in the case 
where vri ^ ^^2 (and tt2 ~fi vri because of instance empty list [] ). Our idea is to let dispatching 
focus on instance checking, and to perform variable binding after synchronization: 

def pop{r) & State x::xs{z) > match z with x: :xs ^ r{x) &: State{xs) 
or push{v) &: [State x::xs{z) or Stateis{z)) > match z with Is —* State{v: -.Is) 
or State{z) > match z with 

I _: :_ ^ State x::xs{z) 

I _ — > Stateis{z) 

One may believe that the matching of the pattern x: :xs needs to be performed twice (once 
in the dispatcher, once in the first reaction rule), but it is not necessary. The compiler 
should know that the matching of z against x: :xs in the first reaction rule cannot fail, 
and as a consequence, no test needs to be performed here, only the binding of the pattern 
variables. See Section 15^ for details. 

5. The compilation {-j 

We formalize the intuitive idea described in Section 0] as a transformer Yx, which trans- 
forms a join definition D with respect to channel x. The algorithm essentially works by 
constructing the meet semi-lattice of the formal pattern arguments of channel x in D, 
modulo pattern equivalence =, with the less precise relation ^ being the partial order. 
Moreover, we visualize the lattice as a Directed Acyclic Graph (DAG), namely, vertices as 
patterns, and edges representing the partial order. If we reason more on instance sets than 
on patterns, this structure is quite close to the "subset graph" of ^Hl ■ 

Algorithm Y^: Given D, a join definition, where 2; is a channel defined by D. 
Step 0: Preprocess. 

(1) Collect all the pattern arguments of channel x into the sequence: 

X = , 7r2 , . . . , 7r„ 

(2) Let 11^ be formed from lix by replacing all variables by wildcards "_" and taking the 
I of all equivalent patterns; thus 11^ is a sequence of pairwise nonequivalent patterns. 
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(3) Perform exhaustiveness check on 11^., if not exhaustive, issue a warning. 

(4) IF: There is only one pattern in 11^, and that 11^ is exhaustive 
THEN: goto Step 5. (In that case, no dispatching is needed.) 

Step 1: Closure by least upper bound. 

For any pattern vr and pattern sequence IT = tti; 7r2; . . . ; vr^, we define vr 1 11 as the sequence 
vr I VTjj ; vr I TTjj ; . . . ; vr I VTj^ , where the VTj^ s are the patterns from IT that are compatible 
with vr. 

We also define function F, which takes a pattern sequence 11 as argument and returns 
a pattern sequence. 
IF: n is empty 
THEN: F(n) = U 

ELSE: Decompose n as vr; W and state F(n) = vr; F(n'); vr j F{U') 
Compute the sequence 0' = F(Il'^). It is worth noticing that Q' is the sequence of all 
valid patterns (vr? | . . . (irf^ ^ | irf^ )•••)> with 1 < ii < i2 < ■ ■ ■ < ik 1^ n, and 1 < k < n, 

where we decompose H'^ as vrf'; vrf'; . . . ; vr^'. 
Step 2: Up to equivalence. 

As in Step 0.2, build by taking the ^ of all equivalent patterns in 17'. 
Step 3: Build DAG: 

Corresponding to the semi-lattice (0, ^), build a directed acyclic graph G{V,£). 

(1) v = 0,^ = 0. 

(2) For each pattern uj in 17, add a new vertex v into V and annotate the vertex with uj. 

(3) V(f , v') G V X V,v ^ v', with annotations to and lo' respectively, if u; ^ iv', then add 
an edge from v' to v into £. 

Step 4: Add dispatcher. 

Following one topological order, the vertices of G are indexed as vi, . . . ,Vm, correspond- 
ingly with annotations uji, . . . , iOm- We extend the join definition D with a dispatcher on 
channel x of the form: x{z) match z with A, where z is a fresh variable and A is built 
as follows: 

(1) Let j ranges over {1, . . . , m}. Following the topological order above, for all vertices vj 
in V append a rule "| ujj — > (z)" to A, where x^j^ is a fresh channel name assigned 
to vertex vj whose annotation is ujj. Such fresh channels are here for the purpose of 
carrying messages originally sent to x then forwarded by the dispatcher, hence are 
also referred to as forwarding channels. 

(2) If Tlx is not exhaustive, then add a rule "| _ — > 0" at the end. 
Step 5: Rewrite reaction rules. 

For each reaction rule defining channel x in D: JiSz x{'Kf) >Qi, we rewrite it according to 
the following policy. Let Q'- = match Zj with vrf Qi, where Zi is a fresh variable. 

IF: coming from Step 

THEN: rewrite to Jj & x{z.i) > Q[ 

ELSE: 

(1) Let be the unique vertex in V, s.t. its annotation cjj. = vr?. 

(2) We collect all the predecessors of Uj. in G, and we record the indices of them, together 
with jj, into a set that we note /(vrf). 

(3) Rewrite to JiSz(\/ x^.(zi)) > Q', where \/ is the generalized or construct of 
join patterns. 
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Given a join definition D, we note dv[Z)] = {xi, . . . ,x„} (n > 0), that is we order the 
channel names arbitrarily. To transform D, we apply Y^^ . . . 1^.^(-D). And the compilation 
of processes [-1 is inductively defined as follows: 

[01 = 

[A&P2I = mkm 

[def Din PI def y,„...y,,(p])in[Pl 

[match e vi^ithp'^-'^ tt^ Pi} =^ match e vi^ithl*^-'^ vTj [Pj] 

[TI T 

pi or D2j = pil or 

Observe that the compilation preserves the interface of join definitions. Namely, it only 
affects the join definitions, never suppressing a channel, while message sending remains the 
same. 



6. Example of compilation 

Given the following join definition for an enriched integer stack: 

def push{v) & State{ls) l> State (v. :ls) 

or pop{r) & State{x: :xs) > r{x) & State{xs) 

or insert{n) &; State{0: :xs) > State{0: :n: :xs) 

or last{r) & State{\.x']) > r{x) & State{\.x']) 

or swapO & State{xi: :x2 - - xs) > State{x2- -xi: :xs) 

or pause{r) &: State^H) > r() 

or resume{r) t> State{\.'\) k. ri) 
The insert channel inserts an integer as the second topmost element, but only when the 
topmost element is 0. The last channel gives back the last element in the stack, keeping 
the stack unchanged. The swap channel exchange the topmost two elements in the stack. 
The pause channel temporarily freezes the stack when it is empty, while the resume channel 
brings the stack back into work. We now demonstrate our transformation with respect to 
channel State. 

Step 0: We collect the pattern arguments of channel State into Instate- 

Estate = Is; x: -.xs; : : xs; [x] ; xi : : 3:2 : : xs; [] ; [] 

We drop the last equivalent [] pattern during the up to equivalence substep 0.2, and we 
get: 

^'state — ^s; x: -.xs; : : xs; [x] ; xi : : X2 : : xs; [] 
Additionally, ^'state exhaustive (pattern Is alone covers all possibilities). Note that in 
the demonstration of this example, we sometimes keep variable names in patterns for 
readers' convenience. They are not necessary and are actually all replaced by "_" in the 
implementation. 
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Step 1,2: extends ^'state with all possible least upper bounds. Then we form O from 
$7' by taking the J of all equivalent patterns. 

$7 = /s; x::xs; 0::xs; [a;]; xi : : X2 : : xs; [] ; : : X2 : : xs; [0] 

Note that the last two patterns are new, where: 

: : X2 : : xs = : : xs f xi : : X2 : : xs 
[0] = : : xs T [x] 

Step 3: We build the semi-lattice (17, ^), see Figure^ 

: : X2 ■■ : xs [0] 




1. 


: : 2:2 : : M 


2. 


[0] 


3. 


: : 2:2 : : 2;s 


4. 


0: -.xs 


5. 


Lxl 


6. 


x: -.xs 


7. 


[] 


8. 


Is 



Figure 4: The semi-lattice of patterns and the topological order 

Step 4: One possible topological order of the vertices is also given at the right of FigurelH 
Following that order, we build the dispatcher on channel State. 

or State{z) \> match z with 

I 0: :_: :_ ^ Statei{z) 
I [0] State2{z) 
I States{z) 
I : :_ ^ Statei{z) 
I [_] — State^{z) 
I _: :_ ^ StateQ{z) 
I [] — > Statei{z) 
I _ —5- States{z) 
where Statei, . . ., States are the fresh forwarding channels. 

Step 5: We rewrite the original reaction rules. As an example, consider the third reaction 
rule for the insert behavior: the pattern in State{0: :xs) corresponds to vertex 4 with 
annotation : : xs in the graph, which has two predecessors: vertex 1 with annotation 
: : X2 : : xs and vertex 2 with annotation [0] . Therefore, the reaction rule is rewritten to: 
insert{n) & {Statei{z3) or State2{z^) or Statei{z-i)) 

l> match 23 with 0: :xs — > State{0: :n: :xs) 
where 23 is a fresh variable. 
As a final result of our transformation, we get the disjunction of the following rules and of 
the dispatcher built in Step 4. 

def push{v) & {Statei{zi) or ... or States{zi)) 

match zi with Is State (v : : Is) 
or pop{r) & {Statei{z2) or ... or State%{z2)) 

> match Z2 with x: :xs — > r(x) &: State{xs) 
or insert{n) & (Stateilz^) or State2{z^) or State/i{z^)) 
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> match Z3 with : : xs ^ State{0 : : n : : xs) 
or last{r) & {State2{zi) or 5^^65(2:4)) 

t> match 24 with [x] r(x) & State{[x']) 
or swap{) & {Statei{z^) or State^{z^)) 

> match Z5 with xi : : X2 : : — s- State{x2 : :x\: : xs) 
or pause{r) & Statej{zQ) > match zg with [] ^ r() 

or resume{r) t> State{\.'\) & r() 

7. Correctness 

A program written in the applied join calculus of Section IHl is a process P. The com- 
pilation JP] replaces all the join definitions D in P by 1^^ . . . Y^^ (D), where dv[D] = 
{xi, . . . , x„}. To guarantee the correctness, we require the programs before and after the 
compilation be open equivalent. Namely, the following theorem should hold. 

Theorem 7.1. For any process P , JP] =^ P. 

Proof. By structural induction on processes. Because o is a full congruence and a tran- 
sitive relation, it suffices to prove one step of the compilation, that is, is correct (see 
Lemma W7I\ below) . □ 

Lemma 7.2. For any join definition D, channel name x G dv[D], and process P, we have: 

def D'mP ^ def Y^{D) in P 

This lemma is crucial to the correctness of the compilation. We elaborate the proof in 
the coming sections. First, we recall the notations of algorithm in Section 17.11 Then, 
we discuss the properties of the dispatcher built by Y^ in Section 17.21 Finally, we prove 
Lemma l7. 21 in Section f7. 31 

7.1. Summary of notations. We summarize the connection between the input and the 
output of Yx- For simplicity, we omit the x superscripts everywhere. According to the 
algorithm given in Section [3 there are two cases during the procedure of Yx , chosen at the 
end of Step 0: 

Case "jump". The case where Steps 1 to 4 are skipped. Then, for any reaction rule of 
the form Jj & x(7rj) > Qi of D, i = 1 . . . n, the pattern vTj is irrefutable, namely, vTj = _. And 
in Yx{D), we have the corresponding reaction rule Ji^x{zi) > match Zj with vTj — > Qi, 
where Zi is fresh. 

Case "go through". The general case. We recall the notations of the DAG G{V,£) built 
by the algorithm. G has m vertices, and following the topological order, the vertices are 
indexed as vi, . . . ,Vm with pattern annotations loi, . . . , uJm- Each vertex is also assigned a 
fresh forwarding channel, written x^^ .. 

For any reaction rule of the form Jj & x(7rj) > Qi oi D, i = 1 . . . n, there exists a unique 
vertex in G called Vj., such that its annotation Wj. = TTj. We use /(vTj) to record the indices 
of the predecessors of Vj^ as well as ji. Note that we have tTj ^ ujj iff j € /(vTj). In Yx{D), we 

have a corresponding reaction rule as Jj &( \J Xi^.{zi)) t> match Zi with iTi ^ Qi, where 
the variable Zi is fresh. Moreover, we add a dispatcher on channel x into Yx{D) as: 
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x{z) match z with 

I ^ x^^{z) 

I UJ,n X^miz) 

I _ — i* if non— exhaustive *) 

where z is a fresh variable. 

7.2. Property of the dispatcher. We go on to discuss the property of the dispatcher buih 
during the transformation on channel x. Let u range over closed expressions, that is over 
values. Modulo pattern equivalence =, the patterns of the dispatcher {LOj,j = l,...,m) 
are all the least upper bounds of the pattern arguments of channel x in the original D 
{TTi,i = 1, . . . ,n). Thus, the vrjS and the ujjS admit the same instances: Ui<i<n l"s(7rj) = 
Ui<j<m Ins(wj). As an immediate consequence, on one hand, for the set of values that do 
not match any of the original vTj s, written H = {ti | Vi,M lns(7rj)}, the values of H do 
not match any tUj either, and those values are silently eaten by the dispatcher. On the 
other hand, given any value u such that there exists at least one vTj with u G lns(7rj), then 
the dispatcher must forward u onto one of the forwarding channels. More precisely, the 
following lemma holds. 

Lemma 7.3. For any value u that is an instance of some original pattern argument iTi, 
the dispatcher forwards u to the forwarding channel assigned to a vertex in G, whose index 
belongs to /(vTj). 

Proof. We thus assume u S lns(7rj). Let K be the set of indices {/c | n E lns(-7rfc)} and 
Hit = {vTfc I k G K}. Let uj be the least upper bound of the patterns in Hk, written tll;^ 
[lo exists, since Hk is non-empty). By steps 1-3 of the compilation algorithm 1^, there 
must exist some vertex denoted by Vjj^ in G with annotation Wj^, = uj. The dispatcher 
forwards message u onto the forwarding channel x^^^^ , for the following two reasons. 

(1) Value u is an instance of Wj^, . 

(2) No pattern of the dispatcher that appears before Wj^, admits u as an instance. Namely, 
any pattern of the dispatcher Wj, 1 < j < m, such that u G \ns{LVj) must be the 
least upper bound of a subset of Hk- Then, since the patterns of the dispatchers are 
ordered topologically (with precision order ^), cojj^ must be the foremost pattern in 
the dispatcher which has u as an instance. Namely, precision order ^ applied to least 
upper bounds is reverse set inclusion applied to instance sets. 

Moreover, because iTi G 11;^ and = = THi^-, we have vr, ^ Wj^. Thus, by definition 
of /(vTj), we have Jk G I{iTi). □ 

In the following, given some value u, we write x^ for the forwarding channel to which 
u is sent by the dispatcher. Using the new notation. Lemma 17.31 is reformulated as follows: 
if n G lns(7rj), then x^ exists and we have Xu G {x^. \ j G /(vTj)}. 

7.3. Proof of Lemma 17.21 

Proof. Following the definition of o , we should prove (def D in P)a ~ (def Yx{D) in P)a, 
for any closing substitution a. In other words, since Yx{D)a = Yx{Da), we should prove: 

def Da in Pai def Yx{Da) in Pai (7.1) 
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where cJi is a minus the (possible) bindings of the variables of clv[D]. Notice that all 
subcomponents Da, Yx{Da) and Pai are closed. Hence, to prove ()7.1() . it suffices to prove 
that Yx is correct for closed terms (Lemma 17.41 below) . □ 

Lemma 7.4. For any closed join definition D, channel name x G dv[Z)], and closed process 
P , we have: 

def DmPKi def Yx{D) in P 
Proof. There are two subcases. 

Case "go through". We construct the following relation TZ: 

n= {(def D in (P&Q), def y^(D) in (P&Q))} 

Above, process P and definition D range respectively over closed processes and closed 
definitions; while Q and Q are particular. Dissect the structure oi D as: 

D = ... or JiSz x{7Ti) >Qi or .. . 

We define Q and Q to be: 

Q = (JJ^g^a;^,5(vri(5)) & (JJ^^^match tt,^^^ with vTj ^ Qiipj^) 

We note the generalized parallel composition. Note that processes Q and Q are (im- 
plicitly) parameterized by the multisets of substitutions A and ^, and by the multiset of 
values U. In the definition of TZ, A, ^ and U range over all appropriate multisets. More 
precisely, given any reaction rule Ji^x{iTi) > Qi from D, we note 5 any (closed) substitu- 
tion on domain rv[7rj]. Then, A stands for any multiset of such substitutions 6. Similarly, 
let -0 be a (closed) substitution on domain rv[Jj] tt) rv[7rj]. Moreover, for any such ip, let 
ipT^. be ip \ rv[7ri] (the restriction of ip on domain rv[7rj]), and ipj. be ip \ rv[Jj]. Because 
rv[7rj] n rv[Jj] = 0, the substitution ip is the sum of ipT^. and ipj., written ip = ipj^^ l+l ijjj^, and 
we further require ipT^. oipj. = ^j. 1+) Then, ^ is any multiset of such substitutions tp. 
Finally, C/ is a multiset of elements from 

Intuitively, we use Q and Q to bridge the differences caused by D and Yx{D). More 
specifically: a message x{ni5) may be forwarded to x^r^si'^^S) by the dispatcher in Yx{D); 
furthermore, if a guarded process Qiip is triggered from D, then from Yx{D), we have the 
corresponding guarded process match TTi^j^. with vTj — > Qiipj^ triggered; finally, a message 
on channel x with a non-matching content, that is from K, will be eaten by Yx{D). 

We analyze the following three aspects of IZ: closure by (closed) evaluation contexts; 
reduction bisimulation; and preservation of barbs. 

• 7^ is closed by closed evaluation contexts up to = [i.e. Lemma LS.lll fl)). For any closed 
evaluation context E[-], we have: 

£;[def D in (P & Q)] = def D or D' in {P &i P' k Q) 

E[deiYxiD) in {P &cQ)] = def YxiD) or D' in {PkP'kQ) 

where clv[i:»] n dv[L»'] = 0, so that Yx{D) or D' = Yx{D or D'). Therefore, we have 
£;[def D in (P&Q)] =n= E[dei Yx{D) in (P&Q)]. 

• 7^ is a reduction bisimulation (i.e. a special case of Lemma l3. 111 (2) because the identity 
in included in =). We only detail the nontrivial cases. 
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def D in (P&iQ) def K,(Z)) in (P & Q) 

Ji'qhx{-Ki5) ^^^^^^^^^^ Ji-qHixijiiS) 

\ 

match niS with tt^ QiT] 




Figure 5: Reduction chasing in case "go through" 

(1) If there is a message x{'Ki5') in P, the right part can forward it to a message XT^.^i{'Ki5') 
by the dispatcher in Yx{D). This reduction is simulated in the left part by no reduc- 
tion, and we add the new substitution 5' into A. 

(2) Similarly, if there is a message x{u') in P, for some n' S the right part can eat the 
message by the dispatcher in Yx{D). This reduction is simulated by no reduction in 
the left part and we add u' into U . 

(3) If a reduction according to the reaction rule Jihx{'ni) \> Qi consumes a molecule 
JiTj ^x{'Ki5) in the left part, for some (5 € A {i.e. x{iTiS) occurs in Q) and J^r/ from P, 
with dom(r/) = rv[Ji]; it can be simulated by consuming Ji?/ &: ^^^^(vrjfj) in the right 

part, using the corresponding reaction rule Jj &( \J x^.{zi))\>maitc\\ Zi with VTj — > 

Qi, because Xt^^s G {x^j^ \ j G -^(ttj)} (Lemma l7.3j) . The derivatives are still in TZ, with 
A shrinking to A \ {5}, and ^ expanding to ^ U {?] l±l 5}. We assume a-conversion 
when necessary to guarantee 5 or] = ij ktl 5. Vice versa. 

(4) Similar to the previous case but this time the left part consumes a molecule J^r/ &; x(7rj(5') , 
where 6' is not from A. Then, the right part simulates this reduction by first for- 
warding the message x{TTi5') to the message XT^^y{'Ki5') as in case|?2l then consuming 
the molecule JiTj ^x.j^^s'i'^i^')- ^ expands to ^ U {ry l±) (5'}. 

(5) The match 'Kiip-wi with tTj — Qiipj^ in Q of the right part can be reduced to 
(QiV'Ji )V'7ri by the semantic rule Match. Because we have ip^^.oipj. = ipj. l±l 
the result of the reduction equals to Qi(V'Ji tt'V'Tri)) that is Qitp. This reduction is 
simulated by no reduction in the left part. However, the process P becomes P & Qi^J, 
and ^ shrinks to ^' \ 

(6) If a reduction involves Qiip from Q of the left part, for some E ^, it can be simulated 
by first reducing the correspondent match TTiip.,^. with tTj Qi^pJi from Q into Qiip 
as in the previous case. 

Figure IS] summarizes the various cases we just examined, where thick lines express the 
TZ relation. 
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def D in (P&Q) def in (P & Q) 

\ 

match TT-iS with tt^ — > QiT] 

i 

Figure 6: Reduction chasing in case "jump" 

TZ preserves barbs {i.e. Lemma 13.111 (3)). We demonstrate def D in {P^Q) ly =^ 
def Yx{D) in {P ^Q) il-y and vice versa. We distinguish the cases that make def D in 

hold. 

(1) Q iy. We have y ^ 6v[D]. Because all variables in dv[Yx{D)] \ 6v[D] are fresh, we 
also have y ^ d\/[Yx{D)]. According to the structure of Q, we must have Qiip [y 
for some V £ Then in Q, we have match TTji/^^. with vr^ Qii^Ji reduces 
to and Qi^i [y. That is, (match vrj'^^. vi^ith vTj —> QiipjJ ij-y, i.e. Q ij-y, i.e. 
defY,{D) in {PkQ)i^y. 

(2) Ply. Obvious. 

The proof of the other direction, i.e. def Y^{D) in {P&iQ)iy =^ def D in (P&Q)JJ.y, 
is obvious since the only case for def Yx{D) in (P&: Q) ly is when P [y 



Following the analysis above, 7^ is a weak barbed congruence up to =. By Lemma f3.12| 
we have 7?. is a weak barbed congruence. 

Let A, ^ and U be empty sets. We have the two processes of Lemma 17.41 satisfy 
relation TZ, hence ~. That is, we proved that Lemma 17.41 holds for case "go through". 

Case "jump". We build another relation TZ, with Q and Q defined as follows: 

Q = JJ^^^ match TTiV'TTi with VTj QiipJi 
and we summarize the property of reduction bisimulation by the diagram of Figure El D 



8. Implementing applied join 

We carried out the practical implementation work of the applied join calculus as an 
extension of the JoCaml system. The extended system is publicly released [HJ. The release 
includes a tutorial that makes extensive use of algebraic patterns in join patterns. In this 
section, we first sketch out the structure of the extended JoCaml compiler, pointing out 
where the transformation should take place. Then some optimizations of our algorithm Y^ 
are reported. 
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8.1. Front end of the (extended) JoCaml compiler. The JoCaml compiler is an ex- 
tension of the OCaml compiler, as the JoCaml language is an extension of the OCaml 
language. Extensions are confined to the first four phases of the compiler. 

More precisely, there are additional tokens in the lexer (such as the keyword def). 
Then, all the constructs of Figure H are parsed and rendered as specific constructs in the 
abstract syntax tree. Typed syntax undergoes a similar extension. Amongst those first 
three compiler phases, only the typer significantly differs from the original OCaml compiler, 
since the JoCaml compiler has to deal with the specific rules for typing the join calculus 
polymorphically |18j . Finally, the typed syntax is translated to lambda- code, which basically 
is A-calculus enriched with primitive types and calls to the runtime library. All constructs 
specific to JoCaml disappear, being replaced by calls to specific primitives in a "Join" 
library, built on top of one of the OCaml thread libraries. In the following, we denote as 
"the JoCaml runtime" , the ordinary (thread aware) OCaml runtime, plus the thread library, 
plus the Join library. To summarize, extending the OCaml system to the JoCaml system 
amounts to modifying the front end of the compiler, and to writing the Join library. 
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Figure 7: The extended JoCaml compiler front end 

Extending JoCaml to handle pattern arguments in join definitions requires further mod- 
ifications. Figure [7| shows the structure of the extended JoCaml compiler. With respect to 
plain JoCaml (without algebraic pattern matching in join definitions), the parser and the 
typer have to be modified to take pattern arguments in channel definitions into account. 
However these extensions are mechanical. The critical modification manifests itself as an 
extra sub-phase (enclosed in the dashed polygon) between the typing phase and the trans- 
lation phase. Not surprisingly, the additional phase carries out the transformation from 
extended join definitions to plain ones, by implementing the compilation scheme [[•]] of Sec- 
tional Once this new transformation is performed, all join definitions in the typed trees are 
plain ones (without pattern arguments). Then, the translator to lambda-code and, more 
importantly, the JoCaml runtime system need not be changed, with respect to the ones of 
the original JoCaml system. 

We in fact also slightly extended the translator, for the sake of performing a few opti- 
mizations (see Section f8.2|) and of avoiding excessive duplications of guarded processes (see 
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Section IHiHl)- The optimizations we perform make use of the sophisticated pattern matching 
compiler and analyzer that are already present in the standard OCaml compiler. 

8.2. Matching optimizations. 

8.2.1. Avoiding redundant matchings. As discussed at the end of Section|ll the compilation 
introduces redundant matchings. For instance, in the stack example, we get: 

def pop{r) & State x:-xs{z) > match z with x::xs r{x) & State{xs) 

or State{y) > match y with 

I _: :_ ^ State x.,,xs{y) 

A pop operation apparently involves matching the State argument twice: once in the dis- 
patcher, to select the appropriate forwarding channel State x-.-.xs-, and again in the reaction 
rule, to perform the bindings of variables x and xs to the head and tail of the cons-cell z. 

However, by construction, the value of argument z is guaranteed to be an instance 
of the pattern x: :xs. This remark is general (see Lemma l7.3|) : for any matching match 
Zi with TTj Qi introduced in reaction rules by Step 5 of algorithm 1^, the value of Zi 
always matches the pattern tTj. In other words, the matching match zi with tTj Qi never 
fails, hence no test need to be performed at all. As a consequence, in the case of the pattern 
x: -.xs, we aim at getting the the following lambda-code0 

let x = field 2; in 

let xs = field 1 z in 

Primitives field 2;" and field 1 2" extract the head and tail from the cons-cell z. 

The requirement is then to write a specific matching compiler that does not issue tests 
when test outputs can be predicted at compile time. In fact, such a matching compiler 
is already present in the OCaml compiler: as it stands, the optimizing pattern matching 
compiler of [22] can output such code, provided it is informed that the compiled matching 
has only one clause and never fails, which is exactly the case for all the matchings match 
Zi with vTj — > Qi introduced in reaction rules by Step 5 of algorithm Y^. Incidentally, the 
condition "the matching can never fail" is expressed simply as "the matching is exhaustive" . 
We also rely on a later phase of the OCaml compiler to inline let-bindings when appropriate. 

As a final remark, it is worth observing that, when the original pattern does not contain 
variables, the compilation of match Zi with vrj — > • • • yields no code: neither test, nor 
binding. 

8.2.2. Avoiding useless forwarding channels. Simple analysis of the dispatcher matching 
enables use to spare some of the forwarding channels. Let us first re-consider the example 
of the complete stack. Our transformer Y applied to channel State yields the following 
dispatcher: 



'In examples, we show lambda-code as OCaml code, enriched with a few primitives. 
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or State{z) > match z with 

I —> Statei{z) 
I [0] State2{z) 
I _::_::_—> Statez{z) 
I : : _ — Statei{z) 
I [_] State^[z) 
I _ : : _ — 5- State%{z) 
I [] ^ State'j{z) 
I _ — > Statesiz) 

In the matching above, some clauses are never matched at runtime. For instance, the 
last clause "_ — > State%{z)" is useless, because of the two immediately preceding clauses 
"_: ..." and "[] ..." that obviously match all the lists. As a consequence, the 
forwarding channel States never carries any message hence it is also useless. Similarly, 
channels State^ and channel State^ are useless. We can optimize by removing both the 
useless clauses from the dispatcher and all occurrences of useless channels from the rewritten 
join patterns. 

To summarize, by applying the optimizations discussed so far, the stack example after 
compilation looks as follows: 

def push{v) &: [Statei{zi) or State2{zi) or State^{zi) or State^{zi) or Statej{zi)) 

> State {v : : zi) 

or pop{r) & {Statei{z2) or State2{zi) or States{zi) or State^{zi)) 

> r{field Z2) & State{field 1 Z2) 
or insert{n) & {Statei{z^) or State2{z^)) 

> State{0 : : n : : field 1 2:3) 

or last{r) &: {State2{zi) or State^{z4)) 

> let X = field 24 in r{x) &: State{ [x] ) 
or swap{) & (Statei^z^) or State^^z^)) 

> let m = field 1 25 in State[field m: :/ieZ(i 25: :/ieW 1 m) 
or pause{r) &: Statej{zQ) > r() 

or resume{r) > State{['\) & r() 
or State{z) > match z with 

I 0: :_: :_ ^ Statei{z) 

I [0] -> State2{z) 

I :_ State3{z) 

I [_] —>■ State^{z) 

I [] State-j{z) 

Thanks to the optimization, three cases are spared from the dispatcher, three channels are 
not allocated, and the size of the or join-patterns decrease significantly. 

To integrate this optimization into the implementation, we modify the algorithm 1^, as 
regards dispatcher construction (Step 4) and rewriting of reaction rules (Step 5). In Step 
4, after the topological sort, we check the usefulness of each vertex. More specifically, to 
check whether vertex Vk is useful or not, with respect to the preceding vertices ui, . . . , Vk-i 
in the topological order, we check the usefulness of pattern ooj. with respect to patterns 
ui, . . . , Wfc-i, where is the annotation pattern of vertex vi. For that purpose, we use 
the standard usefulness checker of OCaml [30], of which we present a simplified version in 
Section [2.11 Then, in Step 5 of the algorithm we retain only the f^'s that are useful. 
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8.3. Compiling or in join patterns. The compilation scheme [[•] introduces disjunctive 
composition into join patterns, a construct that JoCaml did not support before the intro- 
duction of pattern argument in join definitions. In this section, we describe our extensions 
to the JoCaml compiler so as to integrate this new feature. 

When we introduced or in join patterns, we claimed that it is syntactic sugar. That is, 
we define this new construct by distributing & over or, until or reaches the reaction rule 
level, where we finally duplicate the reaction rules themselves. 

( Ji or J2 or • • • or J„) > P =^ ( Ji > P) or ( J2 l> P) or • • • or ( J„ > P) 

The whole process of distributing &: over or and of duplicating the rules can be summarized 
as "expansion of or in join patterns". 

It is not difficult to see that the above mentioned expansion easily produces an expo- 
nential number of reaction rules. For instance, consider the definition: 

def a^(true) > Pi or a^(true) > P2 • • • or a" (true) > P.„ 

or a^{_) k a2(_) & • • • & a"(_) > Pq 
For each channel a* there are two forwarding channels fltrue ■ As a consequence, after 

rewriting, the last reaction rule from the definition above becomes: 

or (airue(^i) or ^^(^i)) & (atrue(^2) or a^(z2)) & • • • & (atrue(^n) or a^(z„)) > Po 
And the expansion of or in join patterns finally yields 2" reaction rules. 

The extended JoCaml compiler indeed performs the expansion of or in join patterns as 
sketched above, except for one point: the guarded processes (Pq in example) is not dupli- 
cated. Instead, guarded processes are compiled into (lambda-code) closures and duplication 
of guarded processes is performed by duplication of pointers to those closures. 

We will illustrate two successive refinements of the idea of sharing guarded processes. 
But before that, let us first examine how guarded processes are compiled and triggered in 
the general case. 

def a{x) & b{y) > P 
or a{x) & c{y) > Q 

The above join definition defines three channels organized in two reaction rules. Target 
lambda-code can be sketched as follows: 

1 let jdef = 



2 

3 let g^a^h} = fun jdef 

4 let X = Join. get-queue jdef ia in 

5 let y = Join.get_queue jdef ib in 

6 Join. unlock jdef; 

7 Join. spawn (fun () —i- [PJa) in 

8 let = fun jdef 

9 let X = Join.get_queue jdef ia in 

10 let y = Join.get_queue jdef ic in 

11 Join. unlock jdef; 

12 Join. spawn (fun () —i- lQ}x) in 
13 



The presented lambda-code only describes the compilation of guarded processes to closures 
g{a,b} Slid g[a,c}- Those guarded closures are subparts of the complete compilation of the 
join definition. They appear as local bindings in the more complete definition jdef, which 
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is not shown. We refer to [2S| for a full explanation about how the JoCaml compiler deals 
with join definitions and guarded processes. Nevertheless, we give a brief description, based 
upon the example. Join definitions are compiled into vector-like structures, and channels 
are pairs of a pointer to such a structure and of a channel slot (written ia etc. above). 
Channel slots are small integers. Here, we assume ia to be 0, if, to be 1, and ic to be 2. 
Based upon channel slots, join patterns are compiled into bitsets. In this example, we 
have 110 for pattern "0(2;) & b{y)" and 101 for "0(2;) & c{yy\ The join definition runtime 
structure holds a list of pairs made of such a bitset and of a pointer to a guarded closure 
( [(110,5{a 6} ; (101,g|a c})] in our example). This join matching list can be seen as the 
result of reaction rules compilation. The definition structure also holds a mutex, an array 
of queues (indexed by channel slots), and an internal bitset that describes the current status 
of queues. In response to message sending over a channel, specific code from the Join library 
first locks the mutex, alters the internal bitset, stores the message in the appropriate queue, 
and then attempt a match. In case a match is found, the corresponding closure {g{a,b} or 
9{a,c} above) is called, with the definition itself as an argument. 

Notice that the closures g{a,b} or g{a,c} have the responsibility to bind formal arguments 
X and y to the appropriate actual arguments, which are extracted from the appropriate 
queues (lines HHSI and l9HT?1|) , and to release the mutex (lines El and [TT|) . The guarded 
process is finally triggered by the means of the primitive Join. spawn that takes a closure 
as argument (lines 171 and I12|) and creates a new thread to run that closure. Here, [[-PJa and 
IQJx represent the compilation to lambda-code of P and Q respectively. It is to be noticed 
that formal parameters may occur free in P and Q. 

Now let us consider the compilation of join definitions with or in their join patterns, 
such as this one: def a{x) & {b{y) or c{y)) > P. Target lambda-code can be sketched as 
follows: 

1 let jdef = 



2 

3 let p = fun jdef x y ^ 

4 Join. unlock jdef] 

5 Join. spawn (fun () — > [PJa) in 

6 let g{a,b} = fun jdef 

7 let X = Join. get-queue jdef ia in 

8 let y = Join.get_queue jdef ib in 

9 p jdef X y in 

10 let g^a,c} = fun jdef 

11 let X = Join.get_queue jdef ia in 

12 let y = Join.get_queue jdef ic in 

13 p jdef X y in 

14 



As a consequence of the expansion of the disjunctive pattern "6(1/) or c{y)" , the join match- 
ing list is [(110,5|a b} ; (101,5{a c})] ) like in the previous example. The two guarded closures 
g{a,b} g{a,c} different, because the value bound to the formal argument y has to be 
extracted either from the queue of channel b or from the queue of channel c, depending 
upon the matched join pattern being "a(a;) & b{y)" or "0(2;) & c{y)" . However, the task 
of unlocking the mutex and of triggering the process P is common to both and is performed 
by a third closure p (lines IHH^ , which is called by the two guarded closures g^a,b} and g^a,c} 
at lines PH and 1131 respectivelv. As a result, duplication of most of the guarded process code 
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is avoided and a reasonable amount of sharing is achieved. One should observe that the 
interface between the library code that performs join matching and the guarded closures is 
preserved: guarded closures are still functions that take a join structure as argument. 

It is in fact possible for the compiler to completely share guarded closures between 
reactions rules that originate from or pattern expansion. But then, guarded closure code 
must be abstracted further with respect to the exact join pattern that is matched. The 
idea of dictionary can be used for this purpose. A dictionary is an array built by the 
compiler. Dictionaries represent mappings from formal parameters to channel slots and 
the compilation of a join pattern now yields a pair of a bitset and of a dictionary. More 
significantly, disjunctive patterns are now compiled into a series of such pairs. For instance, 
the pattern "a(a;) Sz {h{y) or c{y)y^ is now compiled into the two pairs "(110, [|0 ; 1|])" 
and "(101, [|0 ; 2|])", where for instance the dictionary component "[|0 ; 2|]" expresses 
that the formal parameters x and y are to be bound to messages sent on channels a (at slot 
0) and c (at slot 2) respectively. The compiler then generates guarded closures abstracted 
with respect to dictionaries. 

let g{a,{b\c)} = fun jdef diet 

let X = Join.get_queue jdef {field diet) in 

let y = Join. get-queue jdef {field 1 diet) in 

Join. unlock jdef; 

Join. spawn (fun () — > \P\\) 
where field i diet" returns the ith element of the dictionary ^^dict" . The join matching 
list now becomes the following list of triples: 

[ (110, [|0 ; l|],5{a,{6|c)}) ; (101, [|0 ; 2|] ] 
In case a join-pattern bitset is matched, the corresponding closure in the triple is called, 
with the join definition structure and additionally the dictionary in the triple as arguments. 

Adding one dictionary component is the price we should pay to achieve complete sharing 
of guarded closures. However, such a dictionary is not necessary for reaction rules whose 
pattern is not disjunctive. In that case, the compiler can avoid the extra "/zeW i diet" 
calls and replace them by the appropriate channel slots, which are known at compile time. 
However, for the sake of keeping an uniform structure of the join matching list, guarded 
closures should always accept the extra "rficf argument, even when not needed. A simple 
solution is to consider a dummy dictionary, to be passed to such guarded closures that do 
not need a dictionary. 

The current implementation of JoCaml does not use dictionaries. We are still lacking 
experience to be able to assert whether they are worth the price or not. 

9. Related work 

Applied join is "impure" in the sense of Abadi and Fournet's applied vr-calculus 
We too extend an archetypal name passing calculus with pragmatic constructs, in order 
to provide a full semantics that handles realistic language features without cumbersome 
encodings. It is worth noticing that like in PP, we distinguish between variables and names 
(only variables of channel type are treated as names), a distinction that is seldom made 
in pure calculi. Since we aim to prove a program transformation correct, we define the 
equivalence on open terms, those that contain free variables. Abadi and Fournet are able 
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to require their terms to have no free variables, since their goal is to prove properties of 
program execution, namely the correctness of security protocols. 

Our compilation scheme presented in Section [5] can be seen as the combination of 
two basic steps: refining channels and forwarding by dispatcher. The desired property 
of the forwarding behavior (Lemma 17.3(1 constitutes the core of the correctness proof of 
the compilation scheme, which essentially stems from pattern matching theory. There are 
other work that perform the formal treatments of forwarders, for instance [321 119j . but in 
different contexts. Our forwarder demultiplexes messages into separate channels according 
to the pattern of the messages, while |321ll9j use plain channel-to-channel linear forwarders 
to achieve the locality property, i.e. reception on a given channel takes place on an unique 
site. It is to be noticed that the equivalence proof of JH] is with respect to ordinary 
barbed congruence and by the means of a labelled transition system. Yet another example 
is the correctness proof of the compilation of join patterns to smooth orchestrators in |24j . 
The compilation of [21] is less involved than ours since it basically amounts to inserting 
forwarders. 

We established the correctness of our compilation scheme by showing the programs be- 
fore and after compilation to be behavioral congruent. It is usual practice in the literature 
to prove correctness of program transformations by showing semantics preservation. 
is a survey). Here, variations are numerous: they consist in different connections between 
source and target formalism (two independent languages, or with the target being a sub-set 
of the source), different semantics (denotational vs. operational), different equivalence rela- 
tions (observational equivalence, refinement relation, simulation, etc.), and different settings 
(sequential, concurrent, parallel, object-oriented, etc.). Consequently, proof techniques also 
differ. For example, recent work of Blazy et al. [H] reports the formal verification of a C 
compiler front-end in the Coq proof assistant. It handles two independent source and target 
languages, both with big-step operational semantics. The major difficulty of the correct- 
ness proof resides in relating the different memory states and evaluation environments of 
the two languages. A simulation relation is demonstrated from target code to source code 
by induction on evaluation derivation and case study over the last applied evaluation rule. 
Closer to our work, shows the correctness of an optimizing translation that compiles 
away pattern matching in Scala. Proof techniques analogous with ours are applied, i.e. they 
also tackle contexts explicitly by proving congruence and define observational equivalence 
on open terms based on the one between closed terms and closing up by substitutions. 
Moreover, specific to its extractor-base pattern matching, extractors are required to always 
terminate without exception in order to achieve the correctness. 

We now review some programming languages that support concurrency and examine 
how our work can be related to those. Languages whose model for concurrency directly 
stems from the join calculus should benefit from our work. More precisely, if a language al- 
ready offers a la ML pattern matching and join definitions, then its authors can implement 
our ideas in their framework, and their implementation effort would be small. An early 
example of a language based upon the join calculus is Funnel ^H]- Funnel later evolved 
into Scala where a la ML pattern matching is supported and join style concurrency 
is provided in terms of a library [^. Another similar work is j42j . which introduces join 
style concurrency in Haskell. We believe that extending the two settings above with alge- 
braic patterns as formal arguments can be made by direct application of our techniques. 
Smooth orchestrators |21I differ from join definitions in rather subtle ways: an orchestrator 
is syntactically similar to a join definition and can be seen as defining competing reaction 
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rules; however, (1) once a reaction rule of an orchestrator is selected and continuation fired, 
the whole orchestrator (together with other non-selected competing rules) gets expired and 
discarded; and (2) the definitions of channels and of orchestrators that synchronize them 
are separated. Point (2) above is quite subtle: one can orchestrate receptions on channels 
whose definitions are unknown, provided all the orchestrated channels are defined on the 
same site. Nevertheless, orchestrators are controlled by finite automata that extends the 
ones of for join definitions. Thus, the adaptation of our techniques to orchestrators 
looks feasible. 

In addition, there is a sustained interest in integrating join calculus into object-oriented 
languages : polyphonic C" and its successor Co; [7j for C"; and Join Java for Java. 
Unfortunately, the issue here is the lack of pattern matching, which neither C" nor Java 
offers. A detailed discussion on the introduction of a la ML pattern matching in object- 
oriented languages would be out of scope. Briefly, proposed solutions are either by the means 
of preprocessing |H| , or by tighter language integration jl4l I39j . As our compilation scheme 
requires precise information on pattern semantics {e.g. to decide the precision relation 
we think that solutions of the second kind would facilitate the extension of the introduced 
pattern matching to join patterns. 

Erlang |3| features both pattern matching and concurrency. However, concurrency in 
Erlang is based upon the actor model [2DI2]- In this model, messages are sent to actors 
and actors manage a queue of messages. Moreover, the reception behavior of an actor can 
be specified by the receive m construct. This construct is similar to ML pattern matching 
match V with m, except for the value matched v, which is left implicit. The semantics of 
receive m can be described as follows: attempt a match in the actor's queue, scanning it 
from the oldest to the most recent message, stopping when a match is found. This simple 
combination of message passing and pattern matching proves convenient, as witnessed by 
the success of Erlang. However, Erlang in general misses a simple and efficient handling 
of synchronization between actors as join patterns offer. Lacking necessary knowledge of 
Erlang internals, it is difficult for us to assess whether the selection of messages from actors 
queues can benefit from our techniques or not. In any case, difference in semantics is 
outstanding and we conjecture that an adaption of our technique would not be immediate. 
In particular, the existence of one message queue per receiving agent is central to Erlang 
model, while a join definition naturally handles several message queues. 

Finally, we discuss the transplantation of our compilation scheme to a language whose 
semantics for concurrency is based upon the original vr-calculus of [HSl; like for instance 
Pict PiDuce ^Uj without orchestrators. Such a task is apparently impossible. 

Namely, on the one hand, we propose a compilation scheme, and we thus need to isolate 
all the instances of reception on a given channel from program source ; while, on the other 
hand, the vr-calculus features unrestricted input capability. More precisely, in the vr-calculus, 
any process that knows of some channel x can input on it. As a channel name x can be 
passed via messages, reception on x may occur anywhere. The join calculus originates from 
a radical solution to the distributed implementation issue: channels and reception behaviors 
are defined by a synthetic construct, and input on channels cannot occur anywhere else. 
However, there are other solutions that retain the vr-calculus as a basis while restricting input 
capability, such as the localized vr-calculus [35] • Moreover, the located channels of Nomadic 
Pict 133] allows to lift such solutions to a distributed setting. Given such frameworks, we 
shall assume that all receptors on a given channel are known statically. Then, we can 
extend the input construct x{y).P as x(vr).P, where vr is pattern, and expect to be able to 
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translate this extended language into ordinary vr-calculus. In that process, we see at least 
one additional complication. Let vri and 7r2 be two patterns that are compatible {i.e. that 
have instances in common), and let us consider the following program, an analog of the 
simple examples of Section^ 

x(7ri).Pi I x{tT2).P2 

The above process significantly differs from a join definition, since a successful input does 
not discard the other input. A tentative translation in the spirit of ours would be the 
parallel composition of a dispatcher: 

! 2;(z).match z with vri |7r2 ^ XTntTral-z) I t^i — > | ^2 x-j^^iz) 

and of the following process: 

)-Ql) I {x.„^U2{^)-Q2+Xt,^{z).Q2) 

Where Qi is match z with vTj Pi, and "+" is internal choice that we use here to ex- 
press input-guarded choice. Thus, we need input-guarded choice. This is a noticeable 
complication, even though input-guarded choice can be expressed in the 7r-calculus without 
choice j35j . Another concern is the usage of the replication operator "!" in the dispatcher. 
Clearly, the adaptation of our technique to a 7r-calculus setting is not immediate. 

10. Conclusion and future work 

This paper is part of our effort to develop a practical concurrent programming language 
with firm semantical foundations. In our opinion, a programming language is more than 
an accumulation of features. That is, features interact sometimes in unexpected ways, es- 
pecially when intimately entwined. Here, we have studied the interaction between pattern 
matching and concurrency. The framework we have used was the applied join calculus — 
an extension of the join calculus with algebraic data types. Applied join inherits its ca- 
pabilities of communication and concurrency from join and supports value passing. More 
significantly, it allows algebraic pattern matching in both formal arguments of channel defi- 
nitions and guarded processes. Compared with join, applied join provides a more convenient 
(or "pragmatic"), precise and realistic language model to programmers. From that perspec- 
tive, pattern matching and join calculus appear to live well together, with mutual benefits. 
The result of this work reinforces our interest in using a la ML pattern matching as a general 
purpose programming paradigm, and join calculus as the basic paradigm for concurrency. 

Exploiting the fact that JoCaml already had an efficient implementation for both ML 
pattern matching and join primitives, we have designed the implementation of applied join 
as defining a practical compilation scheme that transforms extended join definitions into 
ordinary ones plus ML pattern matching. We have solved the non-determinism problem 
during the design of this compilation scheme. Moreover, we have actually integrated it 
into the JoCaml system with several optimizations. It is worth observing that a direct 
implementation of extended join-pattern matching at the runtime level would significantly 
complicate the management of message queues, which would then need to be scanned in 
search of matching messages before consuming them. As we remarked, our compilation 
technique may yield code of exponential size. However, we expect such blowup not to occur 
in practice, an expectation which is apparently confirmed by our preliminary experiments 
in the JoCaml system. Should this prove wrong in the future, we could face the issue in 
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two manners : either complicate the runtime system as sketched above, or design a direct 
implementation of or in join patterns. 

A theory of process equivalence has also been developed in applied join in order to assess 
the correctness of our compilation scheme. In archetypal name passing calculi, where every 
free variable is of channel type, it is sufficient to only consider terms closed in our sense, i.e. 
terms without free variables of non-channel type, when defining equivalence relations. By 
contrast, applied join supports real values and its static transformations should apply to 
open processes with free variables of non-channel type. To tackle this problem, we have first 
defined a weak barbed congruence to express the equivalence of two closed processes, then 
we have lifted the equivalence relation to open processes by closing up by all substitutions. 
The resulting relation is called "open equivalence". We have demonstrated it is also a full 
congruence and have proved our compilation scheme correct by showing that the processes 
before and after transformation are open equivalent. The proof technique we have used, 
which can be summarized as "full abstraction" , stems from pattern matching theory and the 
fact that inserting an internal forwarding step in communications does not change process 
behavior. 

In previous work, we have designed an object-oriented extension of the join calcu- 
lus ^31231201 5 which appeared to be more difficult. The difficulties reside in the refine- 
ment of the synchronization behavior of objects by using the inheritance paradigm. We 
solved the problem by designing a delicate way of rewriting join patterns at the class level. 
However, the introduction of algebraic patterns in join patterns impacts this class-rewriting 
mechanism. The interaction is not immediately clear. Up to now, we are aware of no 
object-oriented language where the formal arguments of methods can be patterns. We thus 
plan to investigate such a combination of pattern matching and inheritance, both at the 
calculus and language level. 

Another interesting future work would be to extend our framework with more sophis- 
ticated patterns for XML data. As a matter of fact, the authors of Scala have already 
extended the notion of pattern matching to the processing of XML data with the help of 
regular expression patterns (a similar system is PiDuce ^Hl)- Their extension makes Scala 
suitable for developing web service applications. Our model of pattern matching in join cal- 
culus works with general algebraic data types. At the moment, we do not see any particular 
barrier that prevent our model from also working with XML trees. 
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